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Claims 



A method of processing free-format data stored in 
a computing system, comprising the steps of examining 
elements pf the data to determine attributes of the data, 
by examining the content of the elements and the contextual 
relationships of elements to each other, to determine 
semantic and syntactic information (attributes) about the 
data, producing additional data relating to this 
information, jS^n the form of a text object which includes 
pointer means fabling access to the elements of the free- 
format data, anckthe additional data being accessible by a 
query processing means to provide answers to queries 
relating to the semantic and syntactic information about 
the data and/or to access the data to manipulate the data. 



2 . A method irf 
free-format data is IstJsfrec 
field of a database 

3 . A method i 
wherein the data remains 
it was originally stored, 
other applications- \ 

4 . A method in 



jrdance with claim 1, wherein the 
as a record in a free-format 

iorc^ance with claim 1 or claim 2, 
d in the computing system as 
by it may be accessed by 



35 



accordance with any preceding claim, 
wherein the text object inclucfles an attribute - type 
identifier which identifies an \ttribute type of an element 
of the data. 

5. A method in accordance vOdth any preceding claim, 
wherein the text object includes a Value indicating the 
character length of an element of thet data. 

6. A method in accordance with\claim 4 or claim 5, 
wherein the text object includes a valute indicating whether 
an element is low level in a syntactic hierarchy or higher 
level whereby the value may be used for matching purposes 
when matching data with other data processed in accordance 
with the method. 

7 . A method in accordance with any preceding claim, 
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the text object including a match weighting value for an 
elemeW of the data, which can be used to determine the 
significance of the element when matching with other free 
format ofeta. 

5 8. \A method in accordance with any preceding claim, 

wherein the\text object comprises a plurality of component 
nodes arranged according to the semantic structure of the 
free-format dad:a, the component nodes being arranged in a 
hierarchy corresponding to the semantic structure of the 
10 free-format dataXand each component node including 

additional data relating to the corresponding element of 
the free-format datV. 

9. A method iX accordance with any preceding claim, 
comprising the furtheAstep of generating matching values 

15 for comparing an element of the free-format data with an 
element of other f ree-f orfcna£Ndata processed in accordance 
with the present methoaLN, A \ 

10. A method in accirQapeJa with claim 9 where the 
matching value is a phone jbrc Values for phonetically 

20 comparing elements of f r^fe s-f ori^at Qata . 

11. A method in acaordanca wim any preceding claim, 
wherein the text object includes implied data relating to 
information implied from the free-format data. 

12. A method in accordance witk any preceding claim, 
2 5 wherein a plurality of free-format daua records are 

processed and a text object associated Wth each 
free-format data record is produced. \ 

13. A method in accordance with claW 12, wherein the 
text object is stored in the computer system whereby it is 

30 available for queries on the associated freeVformat data 
record via the query processing means. \ 

14. A method in accordance with claim 12\comprising 
the further step of producing a text object indefc including 
attribute type identifiers for elements of each da±a record 

35 and pointers to each data record, whereby the index\may be 
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queried by queries relating to semantic and syntactic 
information about the data and the data may be accessed via 
the indej 

15. ^ method in accordance with claim 14 wherein each 
entry in th^ text object index includes a representative 
value key, which gives a value representative of a feature 
of the elemen^ associated with the attribute - type 
identifier . 

16. A methW in accordance with any preceding claim, 
comprising the further step of carrying out a domain 
construction procesas to construct a domain object from 
domain definition da\a files, the domain object being 
arranged to carry outVthe examination process by parsing 
the free-format data in accordance with grammar rules. 

17. A method in accordance with claim 16, wherein the 



domain definition data 
data, regular expressi<|> 

18 . A method in 
wherein the free-format 

19. A method in accoj 
wherein the query processing me 1 
database operations on the data 



include character definition 
tion data and grammar data, 
with any preceding claim, 
postal address data, 
with any preceding claim 
can carry out normal 
the additional data . 



35 



20. A processing system for\ processing free-format 
data stored in a computing system, Ythe apparatus including 
means for examining elements of theNdata to determine 
attributes of the data, by examining Nthe content of the 
elements and the contextual relationships of elements to 
each other, to determine semantic and syntactic information 
(attributes) about the data, means for producing additional 
data relating to this information, in the f^rm of a text 
object which includes pointer means enabling Recess to the 
elements of the free-format data, and a query processing 
means which is arranged to access the additional\data to 
provide answers to queries relating to the semantic and 
syntactic information about the data and/or to access the 
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data to\ manipulate the data, 

21. \ A processing system in accordance with claim 20, 
wherein tl^e free-format data is stored as a record in a 
free-formak field of a database. 

22. A\ processing system in accordance with claim 20 
or claim 2lA wherein the examining means does not affect 
the storage at the data. 

23. A prc&cessing system in accordance with any one of 
claims 20 to 22 A wherein the text object includes an 
attribute - type \dentifier which identifies an attribute 
type of an element \of the data. 

24. A processing system in accordance with any one of 
claims 20 to 23, wherein the text object includes a value 
indicating the character length of an element of the data. 

25. A processing\system in accordance with claim 23 



or claim 24, wherein th 
indicating whether an 
level in a syntactic hierk 
value may be used for mate 
other free-format data pro 
system. 

26. A processing sy 
claims 20 to 25, wherein 



ject includes a value, 

type of an element is low 
fhy or\high level whereby the 

)oses when matching with 
fed ii\ accordance with this 



tern i\i accordance with any one of 
:he texk object includes a match 
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weighting value for an element ofVthe data, which can be 
used to determine the significance V>f the element when 
matching with other free-format data^ 

27. A processing system in acco^sdance with any one of 
claims 20 to 26, wherein the text objec\ comprises a 
plurality of component nodes arranged according to the 
semantic structure of the free-format data\ the component 
nodes being arranged in a hierarchy corresponding to the 
semantic structure of the free-format data, a\d each 
component node including additional data relating to the 
corresponding element of free-format data. 

28. A processing system in accordance with ahy one of 
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claimsV20 to 27, the text object means for generating 
matching, values for comparing an element of the free-format 
data witnVan element of other free-format data processed by 
the processing system. 

29. A\processing system in accordance with claim 28, 
wherein the matching value is a phonetic value for 
phonetically Comparing elements of free-format data. 

30. A processing system in accordance with any one of 
claims 20 to 29,\wherein the text object includes implied 
data relating to \nformation implied from the free-format 
data . 

31. A processing system in accordance with any one of 
claims 20 to 30, wherein the system is arranged to process 

a plurality of free-fcvrmat data records and produce a text 
object associated with each free-format data record. 

32. A processing s^s^&p in accordance with claim 31, 



)rouuciag 



idex* 
f ofVeaci 



lerem 



lg to\the 
and/c 



additional data is arranged 
luding attribute - type 
data record and pointers 
he query processing means 
index to provide 
emantic and syntactic 
o\ access the data to 
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wherein the means for 
to produce a text objec 
identifiers for elementV 
to each data record and 
is arranged to access thfe tex-fec ob\ect 
answers to queries relati 
information about the da1 
manipulate the data. 

33. A processing system in accordance with claim 32, 
wherein the text object index includes representative value 
keys for entries, which give a value Nrepresentative of a 
feature of the element associated withN^the attribute - type 
identifier for the entry for facilitating matching with 
other free-format data processed in accora^nce with this 
system. 

34. A processing system in accordance v)ith any one of 
claims 20 to 33, further comprising a domain oogect, the 
domain object being arranged to carry out the examination 
process by parsing the free-format data in accordance with 
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gramman rules. 

35 \ A processing system in accordance with claim 34, 
wherein the domain object is produced by a domain 
construction process from domain definition data files. 

36. >A processing system in accordance with claim 35, 
further comprising a domain constructor for carrying out 
the domain construction process, 

37. A processing system in accordance with claim 35 
or claim 36, wherein the domain definition data files 
include character definition data, regular expression 
definition data arid grammar data. 

38. A processing system in accordance with any one of 
claims 20 to 37, wherein the free-format data is postal 
address data . 

39. A processing system in accordance with any one of 



claims 20 to 38 , where 
arranged to carry out 
data via the additiona 

40 . A method of 
stored in a computing 
free- format data reco 
additional data re 
information (attribut 
record, the additiona 
object associated witl\ 



slat Lng 



ial> 
lata . 



query processing means is 
database operations on the 



;na)blini 



iompj 



3 em. 



access to free-format data 
ncluding a plurality of 
sing the steps of storing 
ntic and syntactic 
e data for each data 

in the form of a text 
record, the text object 
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tO VS€ 

is) aboi 
data b&in 
each dat 

including pointer means enablind access to elements of each 
free-format data record, the add^ional data being 
accessible by a query processing nteans to provide answers 
to queries relating to the semanticVand syntactic 
information about the data and/or to\access the data to 
manipulate the data. 

41. A processing system for enabling access to 
free-format data stored in a computing sVstem, including a 
plurality of free-format data records, they processing 
system comprising additional data relating \o semantic and 
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syntactic information (attributes) about the data for each 
data isecord, stored and accessible by the processing 
systemX the additional data being in the form of a text 
object associated with each data record, the , text object 
includingNpointer means enabling access to elements of each 
f ree-formaA data record, and a query processing means 
arranged to Naccess the additional data to provide answers 
to queries reYating to the semantic and syntactic 
information abi&ut the data and/or to access the data to 
manipulate the data. 

42. A method of enabling access to free-format data 
stored in a computing system, including a plurality of 
free-format data records, comprising the steps of storing 
additional data relating tc/ semantic and syntactic 



information (attribute 
record, the additional 
object index which in 
for elements of each da 
record, the text object 



?out \t 



data of each data 
in the form of a text 
bute - type identifiers 
nd pointers to each data 
accessible by a query 
nswers' to queries relating to 



is attr 
record 



index bein 
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processing means to provide 
the semantic and syntactic information about the data 
and/or to access the data to manipulate the data. 

43. A processing system fori enabling access to 
free-format data stored in a computing system, including a 
plurality of free-format data records, the processing 
system comprising the additional data\ relating to semantic 
and syntactic information (attributes) \about the 
free-format data for each data record, r^ie additional data 
being in the form of a text object index Vhich includes 
attribute type identifiers for elements of\each data record 
and pointers to each data record, and a query processing 
means arranged to access the additional data \o provide 
answers to queries relating to the semantic ana syntactic 
information about the data and/or to access the Vata to 
manipulate the data. 
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44. \A method of accessing free-format data processed 
in accordance with the method of any one of claims 1 to 19 
comprising! the steps of accessing the additional data to 
provide answers to queries relating to the semantic and 

5 syntactic information about the data and/or to access the 
data to manipulate the data. 

45. A processing system for enabling access to 
free-format data processed in accordance with the method of 
any one of claima 1 to 19, the processing system including 

10 a query processing means arranged to access the additional 
data and provide answers to queries relating to the 
semantic and syntactic information about the data and/or to 
access the data to manipulate the data. 

46. A processing system for processing free-format 
15 data stored in a computing system, comprising means for 

examining elements c f the d\ta to determine attributes of 
the data, by examining /trie cprftent of the elements and the 
contextual relations hig^d^ eleVents to each other, to 
determine semantic aibd syntactics information (attributes) 
20 about the data, and a [ quer A processing means for utilising 
this information to provide answers to queries relating to 
the semantic and syntactic information about the data 
and/or to access the data. \ 

47. A processing system in accordance with claim 46, 
25 wherein the examining means retaYns the free-format data as 

stored in the computer system, without affecting it. 

48. A method of processing roree-f ormat data stored in 
a computing system, comprising the \steps of examining 
elements of the data to determine attributes of the data, 

30 by examining the content of the elements and the contextual 
relationships of elements to each otheV, to determine 
semantic and syntactic information (attributes) about the 
data, and querying the data using this information to 
provide answers to queries relating to the\semantic and 

35 syntactic information about the data and/or \o access the 



data. \ 

4 9.\ A method of processing free-format data in 
accordance with claim 48, wherein the free-format data is 
unaf f ecteov by the examining process and remains stored in 
5 the computing system as it was originally stored. 

50. A computer readable memory storing instructions 
for controlling: a computer to process free-format data 
stored in a computing system, in accordance with the method 
of any one of claims 1 to 19. 
10 51. A computer readable memory storing instructions 

for controlling a computer to process free-format data 
stored in a computing\system, in accordance with the method 
of claim 48 . \ 

52. A method of processing a plurality of records of 
15 free-format data stored iV a computing system, comprising 

the steps of, for each reco|rck examining elements of the 
data to determine attributis\o\ the data, by examining the 
content of the elements and tW\contextual relationships of 
elements to each other, toL^et^rraLne semantic and syntactic 

20 information (attributes) about ^ac\ record, and producing 
virtual data fields associated with\ach record enabling 
access to this information and the\ associated elements, 
whereby each record is provided witk associated virtual 
data fields enabling access to semancsic and syntactic 

25 information about that record and alsoVaccess to the 
associated elements. \ 

53. A processing system for processing free-format 
data records stored in a computing systemA comprising means 
for examining elements of the data of each Vecord to 

30 determine attributes of the data, by examining the content 
of the elements and the contextual relationship of elements 
to each other, to determine semantic and syntactic 
information (attributes) about the data, and meai^s for 
producing virtual data fields associated with each, record 

35 enabling access to this information and the associated 
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